Sequence Listing could not be accepted due to errors. 
See attached Validation Report. 

If you need help call the Patent Electronic Business Center at (866) 
217-9197 (toll free) . 
Reviewer: Durreshwar Anjum 

Timestamp : [year=2008; month=5; day=30; hr=12; min=17; sec=48; ms = 5 65 ; ] 



*********************************************************** 

Reviewer Comments : 
<210> 30 
<211> 36 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> primer 



<400> 30 

gcttatggcg cgcctcagga ctggagcttg ctccgc 

36n 

21 



Please delete the extra n which appears beside the total number of bases 
36 and also remove the end of file text which is below the bases (21) as 
shown in the attachment above. 

*********************************************** 



Validated By CRFValidator v 1.0.3 
Application No: 10590956 Version No: 



1.0 



Input Set: 



Output Set: 



Started: 2008-05-09 16:08:23.466 

Finished: 2 008-05-09 16:08:25.990 

Elapsed: 0 hr(s) 0 min(s) 2 sec(s) 524 ms 

Total Warnings: 2 4 

Total Errors: 7 

No. of SeqIDs Defined: 30 

Actual SeqID Count: 30 

Error code Error Description 
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Input Set: 



Output Set: 



Started: 2008-05-09 16:08:23.466 

Finished: 2 008-05-09 16:08:25.990 

Elapsed: 0 hr(s) 0 min(s) 2 sec(s) 524 ms 

Total Warnings: 2 4 

Total Errors : 7 

No. of SeqIDs Defined: 30 

Actual SeqID Count: 30 

Error code Error Description 

W 213 Artificial or Unknown found in <213> in SEQ ID (27) 

W 213 Artificial or Unknown found in <213> in SEQ ID (28) 

W 213 Artificial or Unknown found in <213> in SEQ ID (29) 

W 213 Artificial or Unknown found in <213> in SEQ ID (30) 

E 342 » n ' position not defined found at POS : 39 SEQID (30) 

E 254 The total number of bases conflicts with running total Input: 0, 

Calculated : 39 SEQID (30) 

E 323 Invalid/missing amino acid numbering SEQID (30) at Protein (1) 

E 323 Invalid/missing amino acid numbering SEQID (30) POS (1) 

E 323 Invalid/missing amino acid numbering SEQID (30) at Protein (5) 

E 323 Invalid/missing amino acid numbering SEQID (30) at Protein (10) 

E 253 The number of bases differs from <211> Input: 36 Calculated : 3 9 



SEQUENCE LISTING 



<110> Genencor International, Inc. 
Bower, Benjamin 
Mitchinson, Colin 
Larenas, Edmund 

<120> Cellulase Fusion Protein and Heterologous Cellulase Fusion 
Construct Encoding the Same 

<130> GC832-PCT 

<140> 10590956 
<141> 2008-05-09 

<150> PCT/US2005/010242 
<151> 2005-03-25 

<150> US 60/556,711 
<151> 2004-03-25 

<160> 30 

<170> Patentln version 3.2 

<210> 1 
<211> 1570 
<212> DNA 

<213> Trichoderma reesei 
<400> 1 

atgtatcgga agttggccgt catctcggcc ttcttggcca cagctcgtgc tcagtcggcc 60 

tgcactctcc aatcggagac tcacccgcct ctgacatggc agaaatgctc gtctggtggc 120 

acttgcactc aacagacagg ctccgtggtc atcgacgcca actggcgctg gactcacgct 180 

acgaacagca gcacgaactg ctacgatggc aacacttgga gctcgaccct atgtcctgac 240 

aacgagacct gcgcgaagaa ctgctgtctg gacggtgccg cctacgcgtc cacgtacgga 30 0 

gttaccacga gcggtaacag cctctccatt ggctttgtca cccagtctgc gcagaagaac 360 

gttggcgctc gcctttacct tatggcgagc gacacgacct accaggaatt caccctgctt 420 

ggcaacgagt tctctttcga tgttgatgtt tcgcagctgc cgtaagtgac ttaccatgaa 480 

cccctgacgt atcttcttgt gggctcccag ctgactggcc aatttaaggt gcggcttgaa 540 

cggagctctc tacttcgtgt ccatggacgc ggatggtggc gtgagcaagt atcccaccaa 60 0 

caccgctggc gccaagtacg gcacggggta ctgtgacagc cagtgtcccc gcgatctgaa 660 

gttcatcaat ggccaggcca acgttgaggg ctgggagccg tcatccaaca acgcaaacac 720 

gggcattgga ggacacggaa gctgctgctc tgagatggat atctgggagg ccaactccat 78 0 

ctccgaggct cttacccccc acccttgcac gactgtcggc caggagatct gcgagggtga 840 

tgggtgcggc ggaacttact ccgataacag atatggcggc acttgcgatc ccgatggctg 900 

cgactggaac ccataccgcc tgggcaacac cagcttctac ggccctggct caagctttac 960 

cctcgatacc accaagaaat tgaccgttgt cacccagttc gagacgtcgg gtgccatcaa 1020 

ccgatactat gtccagaatg gcgtcacttt ccagcagccc aacgccgagc ttggtagtta 1080 

ctctggcaac gagctcaacg atgattactg cacagctgag gaggcagaat tcggcggatc 1140 

ctctttctca gacaagggcg gcctgactca gttcaagaag gctacctctg gcggcatggt 1200 

tctggtcatg agtctgtggg atgatgtgag tttgatggac aaacatgcgc gttgacaaag 1260 

agtcaagcag ctgactgaga tgttacagta ctacgccaac atgctgtggc tggactccac 1320 

ctacccgaca aacgagacct cctccacacc cggtgccgtg cgcggaagct gctccaccag 1380 

ctccggtgtc cctgctcagg tcgaatctca gtctcccaac gccaaggtca ccttctccaa 1440 

catcaagttc ggacccattg gcagcaccgg caaccctagc ggcggcaacc ctcccggcgg 1500 

aaacccgcct ggcaccacca ccacccgccg cccagccact accactggaa gctctcccgg 1560 



acctactagt 



1570 



<210> 2 

<211> 51 
<212> DNA 

<213> Tr ichoderma reesei 
<400> 2 

atgtatcgga agttggccgt catctcggcc ttcttggcca cagctcgtgc t 51 

<210> 3 
<211> 1438 
<212> DNA 

<213> Tr ichoderma reesei 
<400> 3 

cagtcggcct gcactctcca atcggagact cacccgcctc tgacatggca gaaatgctcg 60 

tctggtggca cttgcactca acagacaggc tccgtggtca tcgacgccaa ctggcgctgg 120 

actcacgcta cgaacagcag cacgaactgc tacgatggca acacttggag ctcgacccta 180 

tgtcctgaca acgagacctg cgcgaagaac tgctgtctgg acggtgccgc ctacgcgtcc 240 

acgtacggag ttaccacgag cggtaacagc ctctccattg gctttgtcac ccagtctgcg 300 

cagaagaacg ttggcgctcg cctttacctt atggcgagcg acacgaccta ccaggaattc 360 

accctgcttg gcaacgagtt ctctttcgat gttgatgttt cgcagctgcc gtaagtgact 420 

taccatgaac ccctgacgta tcttcttgtg ggctcccagc tgactggcca atttaaggtg 480 

cggcttgaac ggagctctct acttcgtgtc catggacgcg gatggtggcg tgagcaagta 540 

tcccaccaac accgctggcg ccaagtacgg cacggggtac tgtgacagcc agtgtccccg 600 

cgatctgaag ttcatcaatg gccaggccaa cgttgagggc tgggagccgt catccaacaa 660 

cgcaaacacg ggcattggag gacacggaag ctgctgctct gagatggata tctgggaggc 72 0 

caactccatc tccgaggctc ttacccccca cccttgcacg actgtcggcc aggagatctg 780 

cgagggtgat gggtgcggcg gaacttactc cgataacaga tatggcggca cttgcgatcc 840 

cgatggctgc gactggaacc cataccgcct gggcaacacc agcttctacg gccctggctc 90 0 

aagctttacc ctcgatacca ccaagaaatt gaccgttgtc acccagttcg agacgtcggg 960 

tgccatcaac cgatactatg tccagaatgg cgtcactttc cagcagccca acgccgagct 1020 

tggtagttac tctggcaacg agctcaacga tgattactgc acagctgagg aggcagaatt 108 0 

cggcggatcc tctttctcag acaagggcgg cctgactcag ttcaagaagg ctacctctgg 1140 

cggcatggtt ctggtcatga gtctgtggga tgatgtgagt ttgatggaca aacatgcgcg 1200 

ttgacaaaga gtcaagcagc tgactgagat gttacagtac tacgccaaca tgctgtggct 1260 

ggactccacc tacccgacaa acgagacctc ctccacaccc ggtgccgtgc gcggaagctg 1320 

ctccaccagc tccggtgtcc ctgctcaggt cgaatctcag tctcccaacg ccaaggtcac 1380 

cttctccaac atcaagttcg gacccattgg cagcaccggc aaccctagcg gcggcaac 1438 

<210> 4 
<211> 81 
<212> DNA 

<213> Trichoderma reesei 
<400> 4 

cctcccggcg gaaacccgcc tggcaccacc accacccgcc gcccagccac taccactgga 60 
agctctcccg gacctactag t 81 

<210> 5 

<211> 480 

<212> PRT 

<213> Trichoderma reesei 



<400> 



5 



Met Tyr Arg Lys Leu Ala Val lie Ser Ala Phe Leu Ala Thr Ala Arg 

15 10 15 

Ala Gin Ser Ala Cys Thr Leu Gin Ser Glu Thr His Pro Pro Leu Thr 

20 25 30 

Trp Gin Lys Cys Ser Ser Gly Gly Thr Cys Thr Gin Gin Thr Gly Ser 

35 40 45 

Val Val lie Asp Ala Asn Trp Arg Trp Thr His Ala Thr Asn Ser Ser 

50 55 60 

Thr Asn Cys Tyr Asp Gly Asn Thr Trp Ser Ser Thr Leu Cys Pro Asp 
65 70 75 80 

Asn Glu Thr Cys Ala Lys Asn Cys Cys Leu Asp Gly Ala Ala Tyr Ala 

85 90 95 

Ser Thr Tyr Gly Val Thr Thr Ser Gly Asn Ser Leu Ser lie Gly Phe 

100 105 110 

Val Thr Gin Ser Ala Gin Lys Asn Val Gly Ala Arg Leu Tyr Leu Met 

115 120 125 

Ala Ser Asp Thr Thr Tyr Gin Glu Phe Thr Leu Leu Gly Asn Glu Phe 

130 135 140 

Ser Phe Asp Val Asp Val Ser Gin Leu Pro Cys Gly Leu Asn Gly Ala 
145 150 155 160 

Leu Tyr Phe Val Ser Met Asp Ala Asp Gly Gly Val Ser Lys Tyr Pro 

165 170 175 

Thr Asn Thr Ala Gly Ala Lys Tyr Gly Thr Gly Tyr Cys Asp Ser Gin 

180 185 190 

Cys Pro Arg Asp Leu Lys Phe lie Asn Gly Gin Ala Asn Val Glu Gly 

195 200 205 

Trp Glu Pro Ser Ser Asn Asn Ala Asn Thr Gly lie Gly Gly His Gly 

210 215 220 

Ser Cys Cys Ser Glu Met Asp lie Trp Glu Ala Asn Ser lie Ser Glu 
225 230 235 240 

Ala Leu Thr Pro His Pro Cys Thr Thr Val Gly Gin Glu lie Cys Glu 

245 250 255 

Gly Asp Gly Cys Gly Gly Thr Tyr Ser Asp Asn Arg Tyr Gly Gly Thr 

260 265 270 

Cys Asp Pro Asp Gly Cys Asp Trp Asn Pro Tyr Arg Leu Gly Asn Thr 

275 280 285 

Ser Phe Tyr Gly Pro Gly Ser Ser Phe Thr Leu Asp Thr Thr Lys Lys 

290 295 300 

Leu Thr Val Val Thr Gin Phe Glu Thr Ser Gly Ala lie Asn Arg Tyr 
305 310 315 320 

Tyr Val Gin Asn Gly Val Thr Phe Gin Gin Pro Asn Ala Glu Leu Gly 

325 330 335 

Ser Tyr Ser Gly Asn Glu Leu Asn Asp Asp Tyr Cys Thr Ala Glu Glu 

340 345 350 

Ala Glu Phe Gly Gly Ser Ser Phe Ser Asp Lys Gly Gly Leu Thr Gin 

355 360 365 

Phe Lys Lys Ala Thr Ser Gly Gly Met Val Leu Val Met Ser Leu Trp 

370 375 380 

Asp Asp Tyr Tyr Ala Asn Met Leu Trp Leu Asp Ser Thr Tyr Pro Thr 
385 390 395 400 

Asn Glu Thr Ser Ser Thr Pro Gly Ala Val Arg Gly Ser Cys Ser Thr 

405 410 415 

Ser Ser Gly Val Pro Ala Gin Val Glu Ser Gin Ser Pro Asn Ala Lys 

420 425 430 

Val Thr Phe Ser Asn lie Lys Phe Gly Pro lie Gly Ser Thr Gly Asn 

435 440 445 

Pro Ser Gly Gly Asn Pro Pro Gly Gly Asn Pro Pro Gly Thr Thr Thr 



450 455 460 

Thr Arg Arg Pro Ala Thr Thr Thr Gly Ser Ser Pro Gly Pro Thr Ser 
465 470 475 480 

<210> 6 
<211> 431 
<212> PRT 

<213> Tr ichoderma reesei 
<400> 6 

Gin Ser Ala Cys Thr Leu Gin Ser Glu Thr His Pro Pro Leu Thr Trp 

15 10 15 

Gin Lys Cys Ser Ser Gly Gly Thr Cys Thr Gin Gin Thr Gly Ser Val 

20 25 30 

Val lie Asp Ala Asn Trp Arg Trp Thr His Ala Thr Asn Ser Ser Thr 

35 40 45 

Asn Cys Tyr Asp Gly Asn Thr Trp Ser Ser Thr Leu Cys Pro Asp Asn 

50 55 60 

Glu Thr Cys Ala Lys Asn Cys Cys Leu Asp Gly Ala Ala Tyr Ala Ser 
65 70 75 80 

Thr Tyr Gly Val Thr Thr Ser Gly Asn Ser Leu Ser lie Gly Phe Val 

85 90 95 

Thr Gin Ser Ala Gin Lys Asn Val Gly Ala Arg Leu Tyr Leu Met Ala 

100 105 110 

Ser Asp Thr Thr Tyr Gin Glu Phe Thr Leu Leu Gly Asn Glu Phe Ser 

115 120 125 

Phe Asp Val Asp Val Ser Gin Leu Pro Cys Gly Leu Asn Gly Ala Leu 

130 135 140 

Tyr Phe Val Ser Met Asp Ala Asp Gly Gly Val Ser Lys Tyr Pro Thr 
145 150 155 160 

Asn Thr Ala Gly Ala Lys Tyr Gly Thr Gly Tyr Cys Asp Ser Gin Cys 

165 170 175 

Pro Arg Asp Leu Lys Phe lie Asn Gly Gin Ala Asn Val Glu Gly Trp 

180 185 190 

Glu Pro Ser Ser Asn Asn Ala Asn Thr Gly lie Gly Gly His Gly Ser 

195 200 205 

Cys Cys Ser Glu Met Asp lie Trp Glu Ala Asn Ser lie Ser Glu Ala 

210 215 220 

Leu Thr Pro His Pro Cys Thr Thr Val Gly Gin Glu lie Cys Glu Gly 
225 230 235 240 

Asp Gly Cys Gly Gly Thr Tyr Ser Asp Asn Arg Tyr Gly Gly Thr Cys 

245 250 255 

Asp Pro Asp Gly Cys Asp Trp Asn Pro Tyr Arg Leu Gly Asn Thr Ser 

260 265 270 

Phe Tyr Gly Pro Gly Ser Ser Phe Thr Leu Asp Thr Thr Lys Lys Leu 

275 280 285 

Thr Val Val Thr Gin Phe Glu Thr Ser Gly Ala lie Asn Arg Tyr Tyr 

290 295 300 

Val Gin Asn Gly Val Thr Phe Gin Gin Pro Asn Ala Glu Leu Gly Ser 
305 310 315 320 

Tyr Ser Gly Asn Glu Leu Asn Asp Asp Tyr Cys Thr Ala Glu Glu Ala 

325 330 335 

Glu Phe Gly Gly Ser Ser Phe Ser Asp Lys Gly Gly Leu Thr Gin Phe 

340 345 350 

Lys Lys Ala Thr Ser Gly Gly Met Val Leu Val Met Ser Leu Trp Asp 

355 360 365 

Asp Tyr Tyr Ala Asn Met Leu Trp Leu Asp Ser Thr Tyr Pro Thr Asn 



370 375 380 

Glu Thr Ser Ser Thr Pro Gly Ala Val Arg Gly Ser Cys Ser Thr Ser 

385 390 395 400 
Ser Gly Val Pro Ala Gin Val Glu Ser Gin Ser Pro Asn Ala Lys Val 

405 410 415 
Thr Phe Ser Asn lie Lys Phe Gly Pro lie Gly Ser Thr Gly Asn 
420 425 430 

<210> 7 
<211> 1077 
<212> DNA 

<213> Acidothermus cellulolyt icus 
<400> 7 

gcgggcggcg gctattggca cacgagcggc cgggagatcc tggacgcgaa caacgtgccg 60 

gtacggatcg ccggcatcaa ctggtttggg ttcgaaacct gcaattacgt cgtgcacggt 120 

ctctggtcac gcgactaccg cagcatgctc gaccagataa agtcgctcgg ctacaacaca 180 

atccggctgc cgtactctga cgacattctc aagccgggca ccatgccgaa cagcatcaat 240 

ttttaccaga tgaatcagga cctgcagggt ctgacgtcct tgcaggtcat ggacaaaatc 300 

gtcgcgtacg ccggtcagat cggcctgcgc atcattcttg accgccaccg accggattgc 360 

agcgggcagt cggcgctgtg gtacacgagc agcgtctcgg aggctacgtg gatttccgac 420 

ctgcaagcgc tggcgcagcg ctacaaggga aacccgacgg tcgtcggctt tgacttgcac 480 

aacgagccgc atgacccggc ctgctggggc tgcggcgatc cgagcatcga ctggcgattg 540 

gccgccgagc gggccggaaa cgccgtgctc tcggtgaatc cgaacctgct cattttcgtc 600 

gaaggtgtgc agagctacaa cggagactcc tactggtggg gcggcaacct gcaaggagcc 660 

ggccagtacc cggtcgtgct gaacgtgccg aaccgcctgg tgtactcggc gcacgactac 720 

gcgacgagcg tctacccgca gacgtggttc agcgatccga ccttccccaa caacatgccc 780 

ggcatctgga acaagaactg gggatacctc ttcaatcaga acattgcacc ggtatggctg 840 

ggcgaattcg gtacgacact gcaatccacg accgaccaga cgtggctgaa gacgctcgtc 90 0 

cagtacctac ggccgaccgc gcaatacggt gcggacagct tccagtggac cttctggtcc 960 

tggaaccccg attccggcga cacaggagga attctcaagg atgactggca gacggtcgac 1020 

acagtaaaag acggctatct cgcgccgatc aagtcgtcga ttttcgatcc tgtcggc 1077 

<210> 8 
<211> 359 
<212> PRT 

<213> Acidothermus cellulolyt icus 
<400> 8 

Ala Gly Gly Gly Tyr Trp His Thr Ser Gly Arg Glu lie Leu Asp Ala 

15 10 15 

Asn Asn Val Pro Val Arg lie Ala Gly lie Asn Trp Phe Gly Phe Glu 

20 25 30 

Thr Cys Asn Tyr Val Val His Gly Leu Trp Ser Arg Asp Tyr Arg Ser 

35 40 45 

Met Leu Asp Gin lie Lys Ser Leu Gly Tyr Asn Thr lie Arg Leu Pro 

50 55 60 

Tyr Ser Asp Asp lie Leu Lys Pro Gly Thr Met Pro Asn Ser lie Asn 
65 70 75 80 

Phe Tyr Gin Met Asn Gin Asp Leu Gin Gly Leu Thr Ser Leu Gin Val 

85 90 95 

Met Asp Lys lie Val Ala Tyr Ala Gly Gin lie Gly Leu Arg lie lie 

100 105 110 

Leu Asp Arg His Arg Pro Asp Cys Ser Gly Gin Ser Ala Leu Trp Tyr 

115 120 125 

Thr Ser Ser Val Ser Glu Ala Thr Trp lie Ser Asp Leu Gin Ala Leu 
130 135 140 



Ala Gin Arg Tyr 
145 

Asn Glu Pro His 

Asp Trp Arg Leu 
180 

Asn Pro Asn Leu 
195 

Asp Ser Tyr Trp 
210 

Val Val Leu Asn 

225 

Ala Thr Ser Val 

Asn Asn Met Pro 
260 

Gin Asn lie Ala 
275 

Ser Thr Thr Asp 
290 

Pro Thr Ala Gin 

305 

Trp Asn Pro Asp 

Gin Thr Val Asp 
340 

Ser lie Phe Asp 
355 



Lys Gly Asn Pro 
150 

Asp Pro Ala Cys 
165 

Ala Ala Glu Arg 

Leu lie Phe Val 
200 

Trp Gly Gly Asn 

215 

Val Pro Asn Arg 
230 

Tyr Pro Gin Thr 
245 

Gly lie Trp Asn 

Pro Val Trp Leu 
280 

Gin Thr Trp Leu 

295 

Tyr Gly Ala Asp 
310 

Ser Gly Asp Thr 
325 

Thr Val Lys Asp 
Pro Val Gly 



Thr Val Val Gly 
155 

Trp Gly Cys Gly 
170 

Ala Gly Asn Ala 
185 

Glu Gly Val Gin 

Leu Gin Gly Ala 

220 

Leu Val Tyr Ser 

235 

Trp Phe Ser Asp 
250 

Lys Asn Trp Gly 
265 

Gly Glu Phe Gly 

Lys Thr Leu Val 
300 

Ser Phe Gin Trp 
315 

Gly Gly lie Leu 

330 

Gly Tyr Leu Ala 
345 



Phe Asp Leu His 
160 

Asp Pro Ser lie 
175 

Val Leu Ser Val 
190 

Ser Tyr Asn Gly 
205 

Gly Gin Tyr Pro 

Ala His Asp Tyr 
240 

Pro Thr Phe Pro 
255 

Tyr Leu Phe Asn 
270 

Thr Thr Leu Gin 

285 

Gin Tyr Leu Arg 

Thr Phe Trp Ser 

320 

Lys Asp Asp Trp 
335 

Pro lie Lys Ser 
350 



<210> 9 

<211> 1914 

<212> DNA 

<213> Acidothermus cellulolyt icus 



<400> 9 

aacgacccgt acatccagcg gttcctcacg atgtacaaca agattcacga cccagcgaac 60 

ggctacttca gcccgcaggg aattccctac cactcggtag aaacgctcat cgttgaggca 120 

ccggactacg ggcacgagac aacttcggag gcgtacagct tctggctctg gctcgaagcg 18 0 

acgtacggcg cagtgaccgg caactggacg ccgttcaaca acgcctggac gacgatggaa 240 

acgtacatga tcccgcagca cgcggaccag ccgaacaacg cgtcgtacaa ccccaacagc 300 

ccggcgtcgt acgctccgga agagccgctg cccagcatgt acccggttgc catcgacagc 360 

agcgtgccgg ttgggcacga cccgctcgcc gccgaattgc agtcgacgta cggcactccg 420 

gacatttacg gcatgcactg gctggccgac gttgacaaca tctacggata cggcgacagc 480 

cccggcggtg gttgcgaact cggtccttcc gctaagggcg tctcctacat caacacattc 540 

cagcgcggct cgcaggagtc cgtctgggag acggtcaccc agccgacgtg cgacaacggc 600 

aagtacggtg gggcgcacgg ctacgtcgac ctgttcatcc agggttcgac gccgccgcag 660 

tggaagtaca ccgatgcccc ggacgccgac gcccgtgccg tccaggctgc gtactgggcc 720 
tacacctg 



